Picture for Yujie Wei

Yujie Wei

Do All Individual Layers Help? An Empirical Study of Task-Interfering Layers in Vision-Language Models

Add code
Feb 01, 2026
Viaarxiv icon

Learning to Accelerate Vision-Language-Action Models through Adaptive Visual Token Caching

Add code
Jan 31, 2026
Viaarxiv icon

Inject Once Survive Later: Backdooring Vision-Language-Action Models to Persist Through Downstream Fine-tuning

Add code
Jan 31, 2026
Viaarxiv icon

Dynamic Differential Linear Attention: Enhancing Linear Diffusion Transformer for High-Quality Image Generation

Add code
Jan 20, 2026
Viaarxiv icon

ReViSE: Towards Reason-Informed Video Editing in Unified Models with Self-Reflective Learning

Add code
Dec 11, 2025
Figure 1 for ReViSE: Towards Reason-Informed Video Editing in Unified Models with Self-Reflective Learning
Figure 2 for ReViSE: Towards Reason-Informed Video Editing in Unified Models with Self-Reflective Learning
Figure 3 for ReViSE: Towards Reason-Informed Video Editing in Unified Models with Self-Reflective Learning
Figure 4 for ReViSE: Towards Reason-Informed Video Editing in Unified Models with Self-Reflective Learning
Viaarxiv icon

Taming Consistency Distillation for Accelerated Human Image Animation

Add code
Apr 15, 2025
Viaarxiv icon

DreamRelation: Relation-Centric Video Customization

Add code
Mar 10, 2025
Viaarxiv icon

FreeScale: Unleashing the Resolution of Diffusion Models via Tuning-Free Scale Fusion

Add code
Dec 12, 2024
Figure 1 for FreeScale: Unleashing the Resolution of Diffusion Models via Tuning-Free Scale Fusion
Figure 2 for FreeScale: Unleashing the Resolution of Diffusion Models via Tuning-Free Scale Fusion
Figure 3 for FreeScale: Unleashing the Resolution of Diffusion Models via Tuning-Free Scale Fusion
Figure 4 for FreeScale: Unleashing the Resolution of Diffusion Models via Tuning-Free Scale Fusion
Viaarxiv icon

Timestep Embedding Tells: It's Time to Cache for Video Diffusion Model

Add code
Nov 28, 2024
Figure 1 for Timestep Embedding Tells: It's Time to Cache for Video Diffusion Model
Figure 2 for Timestep Embedding Tells: It's Time to Cache for Video Diffusion Model
Figure 3 for Timestep Embedding Tells: It's Time to Cache for Video Diffusion Model
Figure 4 for Timestep Embedding Tells: It's Time to Cache for Video Diffusion Model
Viaarxiv icon

PersonalVideo: High ID-Fidelity Video Customization without Dynamic and Semantic Degradation

Add code
Nov 26, 2024
Figure 1 for PersonalVideo: High ID-Fidelity Video Customization without Dynamic and Semantic Degradation
Figure 2 for PersonalVideo: High ID-Fidelity Video Customization without Dynamic and Semantic Degradation
Figure 3 for PersonalVideo: High ID-Fidelity Video Customization without Dynamic and Semantic Degradation
Figure 4 for PersonalVideo: High ID-Fidelity Video Customization without Dynamic and Semantic Degradation
Viaarxiv icon